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Multidatabase s ystems: Query optimization in multidatabase systems 

Qiang Zhu 

November 1992 Proceedings of the 1992 conference of the Centre for Advanced Studies 

on Collaborative research - Volume 2 

Full text available: ^| pdf(1.03 MB) Additional Information: full citation , abstract , references , citings 

A multidatabase system (MDBS) integrates information from autonomous local databases 
managed by heterogeneous database management systems (DBMS) in a distributed 
environment. For a query involving more than one database, global query optimization 
should be performed to achieve good overall system performance. The significant 
differences between an MDBS and a traditional distributed database system (DDBS) make 
query optimization in the former more challenging than in the latter. Challenges for qu ... 

Keywords: adaptive query optimization, distributed query optimization, multidatabase 
system, parametric query optimization, probing query, query processing, semantic query 
optimization 



Lo g ic-based approach to semantic q uery o ptimization 

Upen S. Chakravarthy, John Grant, Jack Minker 

June 1990 ACM Transactions on Database Systems (TODS), volume 15 issue 2 

Additional Information: full citation , abstract , references , citin gs, index 

terms 

The purpose of semantic query optimization is to use semantic knowledge (e.g., integrity 
constraints) for transforming a query into a form that may be answered more efficiently 
than the original version. In several previous papers we described and proved the 
correctness of a method for semantic query optimization in deductive databases couched in 
first-order logic. This paper consolidates the major results of these papers emphasizing the 
techniques and their applicability for optimizing rel ... 




Parametric query optimization 

Yannis E. Ioannidis, Raymond T. Ng, Kyuseok Shim, Timos K. Sellis 

May 1997 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 6 Issue 2 

Full text available: fSl pdf (378.53 KB ) Additional Information: full citation , abstract , citings, index terms 



In most database systems, the values of many important run-time parameters of the 
system, the data, or the query are unknown at query optimization time. Parametric query 
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optimization attempts to identify at compile time several execution plans, each one of which 
is optimal for a subset of all possible values of the run-time parameters. The goal is that at 
run time, when the actual parameter values are known, the appropriate plan should be 
identifiable with essentially no overhead. We present a g ... 

4 Description logics for semantic query optimization in ob j ect-oriented dat aba se sy stems 

Domenico Beneventano, Sonia Bergamaschi, Claudio Sartori 

March 2003 ACM Transactions on Database Systems (TODS), volume 28 issue l 

Full text available: pdf(406.56 KB) Additional Information: full citation , abstract , references , index terms 

Semantic query optimization uses semantic knowledge (i.e., integrity constraints) to 
transform a query into an equivalent one that may be answered more efficiently. This 
article proposes a general method for semantic query optimization in the framework of 
Object-Oriented Database Systems. The method is effective for a large class of queries, 
including conjunctive recursive queries expressed with regular path expressions and is 
based on three ingredients. The first is a Description Logic, ODL 

Keywords: Semantic query optimization, description logics, integrity constraints rules, 
query rewriting method, semantic expansion of a query, subsumption 



5 Query Optimization in Database Systems 

Matthias Jarke, Jurgen Koch 

June 1984 ACM Computing Surveys (CSUR), volume 16 issue 2 

Full text available: f£l pdf(2.84 MB) Additional Information: full citation , references , citings , index terms 



6 A method for automatic rule derivation to support semantic query optimization 

Michael Siegel, Edward Sciore, Sharon Salveter 

December 1992 ACM Transactions on Database Systems (TODS), volume 17 issue 4 

Full text available- fS pdf(2.73 MB) Additional Information: full citation , abstract , references , citings , index 

terms 

The use of inference rules to support intelligent data processing is an increasingly important 
tool in many areas of computer science. In database systems, rules are used in semantic 
query optimization as a method for reducing query processing costs. The savings is 
dependent on the ability of experts to supply a set of useful rules and the ability of the 
optimizer to quickly find the appropriate transformations generated by these rules. 
Unfortunately, the most useful rules are not always th ... 

Keywords: integrity constraint, learning, transformation heuristic 




7 Efficient and extensible algorithms for multi query optimization 

Prasan Roy, S. Seshadri, S. Sudarshan, Siddhesh Bhobe 

May 2000 ACM SIGMOD Record , Proceedings of the 2000 ACM SIGMOD international 
conference on Management of data, volume 29 issue 2 

Full text available: 111 pdf(1 97 59 KB) Additional Information: full citation, abstract , references , citings, index 

™ t erms 

Complex queries are becoming commonplace, with the growing use of decision support 
systems. These complex queries often have a lot of common sub-expressions, either within 
a single query, or across multiple such queries run as a batch. Multiquery optimization aims 
at exploiting common sub-expressions to reduce evaluation cost. Multi-query optimization 
has hither-to been viewed as impractical, since earlier algorithms were exhaustive, and 
explore a doubly exponential search space. 
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8 Query optimization on local area networks 

Alan R. Hevner, 0. Q. Wu, S. B. Yao 

January 1985 ACM Transactions on Information Systems (TOIS), volume 3 issue l 

Full text available* TO pdf(1.75 MB) Additional Information: full citation , abstract , references , citings , index 

• terms, review 

Local area networks are becoming widely used as the database communication framework 
for sophisticated information systems. Databases can be distributed among stations on a 
network to achieve the advantages of performance, reliability, availability, and modularity. 
Efficient distributed query optimization algorithms are presented here for two types of local 
area networks: address ring networks and broadcast networks. Optimal algorithms are ... 

9 Stochastic query o ptimization in distributed databases 

P. E. Drenick, E. J. Smith 

June 1993 ACM Transactions on Database Systems (TODS), volume 18 issue 2 

Full text available* *0 odfd 64 MB) Additional Information: full citation , abstra ct, references, inde^erms, 

' 1 review 

Many algorithms have been devised for minimizing the costs associated with obtaining the 
answer to a single, isolated query in a distributed database system. However, if more than 
one query may be processed by the system at the same time and if the arrival times of the 
queries are unknown, the determination of optimal query-processing strategies becomes a 
stochastic optimization problem. In order to cope with such problems, a theoretical state- 
transition model is presented that treats the s ... 

Keywords: distributed query processing, state-transition model, stochastic query 
optimization 





10 Iterative dynamic programming: a new class of query optimization algorithms 
Donald Kossmann, Konrad Stocker 

March 2000 ACM Transactions on Database Systems (TODS), volume 25 issue i 

Full text available* fg| pdf(955 89 KB) Additional information: full citation , abstract , references , citings , index 

■ : terms, review 

The query optimizer is one of the most important components of a database system. Most 
commercial query optimizers today are based on a dynamic-programming algorithm, as 
proposed in Selinger et al. [1979]. While this algorithm produces good optimization results 
(i.e, good plans), its high complexity can be prohibitive if complex queries need to be 
processed, new query execution techniques need to be integrated, or in certain 
programming environments (e.g., distributed database systems). In ... 

Keywords: dynamic programming, greedy algorithm, iterative dynamic programming, plan 
evaluation function, query optimization, randomized optimization 



1 1 OPT++ : an object-oriented implementation for extensible database query optimization 

Navin Kabra, David J. DeWitt 

April 1999 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 8 Issue 1 

Full text available: ^ pdfd 67.26 KB) Additional Information: full citation , abstract , citings, index terms 

In this paper we describe the design and implementation of OPT++, a tool for extensible 
database query optimization that uses an object-oriented design to simplify the task of 
implementing, extending, and modifying an optimizer. Building an optimizer using OPT+ + 
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makes it easy to extend the query algebra (to add new query algebra operators and 
physical implementation algorithms to the system), easy to change the search space, and 
also to change the search strategy. Furthermore, OPT++ comes equipp ... 

Keywords: Extensibility, Object-relational databases, Query optimization, Software 
architecture 



12 Multiple-query optimization 

Timos K. Sellis 

March 1988 ACM Transactions on Database Systems (TODS), volume 13 issue l 

Full text available: l5ledf(2J9MB) Additional Information: full citation , abstract , references , citings , index 

[Aj-fc! terms , review 

Some recently proposed extensions to relational database systems, as well as to deductive 
database systems, require support for multiple-query processing. For example, in a 
database system enhanced with inference capabilities, a simple query involving a rule with 
multiple definitions may expand to more than one actual query that has to be run over the 
database. It is an interesting problem then to come up with algorithms that process these 
queries together instead of one query at a time. Th ... 

13 Cache investment: integrating query optimization and distributed data placement 

Donald Kossmann, Michael J. Franklin, Gerhard Drasch, Wig Ag 

December 2000 ACM Transactions on Database Systems (TODS), volume 25 issue 4 

Full text available: f£lpdf(210.67 KB) Additional Information: full citation , abstract , references , citings , index 

terms 

Emerging distributed query-processing systems support flexible execution strategies in 
which each query can be run using a combination of data shipping and query shipping. As in 
any distributed environment, these systems can obtain tremendous performance and 
availability benefits by employing dynamic data caching. When flexible execution and 
dynamic caching are combined, however, a circular dependency arises: Caching occurs as a 
by-product of query operator placement, but query operator pi ... 

Keywords: cache investment, caching, client-server database systems, data shipping, 
dynamic data placement, query optimization, query shipping 

1 4 Multidatabase systems: Query optimization using fuzzy set theory for multidatabase 
systems 

Qiang Zhu, P. A. Larson 

October 1993 Proceedings of the 1993 conference of the Centre for Advanced Studies 

on Collaborative research: distributed computing - Volume 2 

Full text available: 1*Q pdf(761.60 KB) Additional Information: full citation , abstract, references , citings 






One of the challenges for global query optimization in a multidatabase system (MDBS) is 
that some local optimization information may not be accurately known at the global level 
because of local autonomy. In this paper, we introduce a fuzzy query optimization approach 
that is based on fuzzy set theory, introduced by Zadeh in the 1960s, to tackle the 
challenge. We describe the problem of fuzzy query optimization and compare the fuzzy 
optimization approach with the traditional (crisp) optimization ... 

Keywords: fuzzy optimization, fuzzy set, multidatabase system, query optimization 
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Query o ptimization in a memory-resident domain relational calculus database system 
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Kyu-Young Whang, Ravi Krishnamurthy 

March 1990 ACM Transactions on Database Systems (TODS), volume is issue l 

Full text available: 1^1 pdf(2.46 MB ) Additional Information: full citation, abstract, references, cjtjngs, index 

. [ajp terms 

We present techniques for optimizing queries in memory-resident database systems. 
Optimization techniques in memory-resident database systems differ significantly from 
those in conventional disk-resident database systems. In this paper we address the 
following aspects of query optimization in such systems and present specific solutions for 
them: (1) a new approach to developing a CPU-intensive cost model; (2) new optimization 
strategies for main-memory query processing; (3) new insight into ... 

1 6 Quer y processin g : A characterization of the sensitivity of quer y optimization to storage 
access cost parameters 

Frederick R. Reiss, Tapas Kanungo 

June 2003 Proceedings of the 2003 ACM SIGMOD international conference on 
Management of data 

Full text available: ^| pdf(255.35 KB ) Additional Information: full citation , abstract , references , index terms 

Most relational query optimizers make use of information about the costs of accessing 
tuples and data structures on various storage devices. This information can at times be off 
by several orders of magnitude due to human error in configuration setup, sudden changes 
in load, or hardware failure. In this paper, we attempt to answer the following questions:* 
Are inaccurate access cost estimates likely to cause a typical query optimizer to choose a 
suboptimal query plan?* If an optimizer ... 

Keywords: autonomic computing, computational geometry, databases, parametric query 
optimization, storage systems 



17 A piggyback meth od t o coll ect statistics for query opti mizati on in d at abase 
mana g ement systems 

Qiang Zhu, Brian Dunkel, Nandit Soparkar, Suyun Chen, Berni Schiefer, Tony Lai 
November 1998 Proceedings of the 1998 conference of the Centre for Advanced Studies 

on Collaborative research 

Full text available: fiQ pdf(328.82 KB) Adc,itiona, Information: full citation , abstract , references, citings, index 

terms 

A database management system (DBMS) performs query optimization based on statistical 
information about data in the underlying data-base. Out-of-date statistics may lead to 
inefficient query processing in the system. Existing solutions to this problem have some 
drawbacks such as heavy administrative burden, high system load, and tardy updates. To 
overcome these drawbacks, our new approach, called the piggyback method, is proposed in 
this paper. The key idea is to piggyback some additional retriev ... 

Keywords: access method, cost estimation, database management system, piggyback 
analysis, query optimization, statistics collection 



18 On compile-time query optimization in deductive databases by means of static filtering 

Michael Kifer, Eliezer L. Lozinskii 

September 1990 ACM Transactions on Database Systems (TODS), volume is issue 3 

Full text available: f g| pdf(3.49 MB) Additional Information: full citation , abstract , references , citings , index 

^ terms , review 

We extend the query optimization techniques known as algebric manipulations with 
relational expressions [48] to work with deductive databases. In particular, we propose a 
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method for moving data-independent selections and projections into recursive axioms, 
which extends all other known techniques for performing that task [2, 3, 9, 18, 20]. We 
also show that, in a well-defined sense, our algorithm is optimal among the algorithms that 
propagate data-independent selections through recursion. 



Keywords: dataflow, deductive databases, filtering, fixpoint, graph representation, 
inference, projection, recursive rules, selection 



19 An overview of query optimization in relational systems 

Surajit Chaudhuri 

May 1998 Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium 
on Principles of database systems 

Full text available: 1f| pdf(1.42 MB) Additional Information: full citation, references , citings, index terms 




20 Special issue in parallelism in database systems: Considering data skew factor in 
multi-way join query o ptimization for parallel execution 

Kien A. Hua, Yo Lung Lo, Honesty C. Young 

July 1993 The VLDB Journal — The International Journal on Very Large Data Bases, 

Volume 2 Issue 3 

Full text available: ^pdf(1-43 MB) Additional Information: full citation , abstract , references , citings 

A consensus on parallel architecture for very large database management has emerged. 
This architecture is based on a shared-nothing hardware organization. The computation 
model is very sensitive to skew in tuple distribution, however. Recently, several parallel join 
algorithms with dynamic load balancing capabilities have been proposed to address this 
issue, but none of them consider multi-way join problems. In this article we propose a 
dynamic load balancing technique for multi-way joins, and i ... 

Keywords: load balancing, multi-way join, parallel-database computer, query optimization 
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